CDS
Accession Number | TCMCG036C21115 |
gbkey | CDS |
Protein Id | PTQ29715.1 |
Location | complement(join(2554..2768,2940..3085,3283..3456,3570..3669,3796..3951,4178..4286,4462..4679,4921..5041,5410..5497,5674..5743,5859..5979,6183..6350,6514..6699,6980..7066,7160..7302,7531..7699,8094..8189,8314..8396,8574..8691,8844..8981,9194..9262,9393..9617)) |
GeneID | Phytozome:Mapoly0135s0001 |
Organism | Marchantia polymorpha |
locus_tag | MARPO_0135s0001 |
Protein
Length | 999aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA53523, BioSample:SAMN00769973 |
db_source | KZ772807.1 |
Definition | hypothetical protein MARPO_0135s0001 [Marchantia polymorpha] |
Locus_tag | MARPO_0135s0001 |
EGGNOG-MAPPER Annotation
COG_category | G |
Description | Belongs to the glycosyl hydrolase 31 family |
KEGG_TC | - |
KEGG_Module | - |
KEGG_Reaction |
R00028
[VIEW IN KEGG] R00801 [VIEW IN KEGG] R00802 [VIEW IN KEGG] R06087 [VIEW IN KEGG] R06088 [VIEW IN KEGG] |
KEGG_rclass |
RC00028
[VIEW IN KEGG] RC00049 [VIEW IN KEGG] RC00077 [VIEW IN KEGG] |
BRITE |
ko00000
[VIEW IN KEGG] ko00001 [VIEW IN KEGG] ko01000 [VIEW IN KEGG] |
KEGG_ko |
ko:K01187
[VIEW IN KEGG] |
EC |
3.2.1.20
[VIEW IN KEGG]
[VIEW IN INGREDIENT] |
KEGG_Pathway |
ko00052
[VIEW IN KEGG] ko00500 [VIEW IN KEGG] ko01100 [VIEW IN KEGG] map00052 [VIEW IN KEGG] map00500 [VIEW IN KEGG] map01100 [VIEW IN KEGG] |
GOs | - |
Sequence
CDS: ATGCCCCCGGCAAAGAAGATGCTATGGGCGCCCATAATTCAAGAAGGTGTCTTCCGGTTCGATGCCAATGAGGGGGCAAAAAAGCAAGCCTGGCCCTCTGTGTCCTTCGTAAATGGGCAGGATCGAGAGTCGCCCATCACGTTGACAGCCGAGGCCCACATTCGGGAGCCCCTCTACATCCCTCAGTGCAGGACGGACAATGGCTCACAGACTATCACTGTGAAACTGCCTGAAGGCACTTCATTCTATGGTACTGGTGAAGTTAGTGGGCCCCTTGAGAGAACCGGGAAAAGGGTCTTCGCTTGGAACACTGATGCATGGGGTTATGGGCCCAGCACAACAGCATTGTATCAGTCACATCCTTGGGTTCTTGCGCTCCTTCCAGATGGAACTACTTTCGGCGTTTTAGCTGACACCACTCGCCGAGCTGAGATCGATACCCGAAAGGCGTCTACCATCCGTTTTGTAGCATCTGGATCGTATCCTGTCATAACATTTGGACCGTTTTCGTCCCCTGAAGCAGTCCTGACTGCTCTCTCCAAAGCTACAGGAACTTTGGCAATGCCGCCAAAGTGGACGCTTGGGTATCAACAGTGCAGGTGGAGTTATGAAACTGCAGACCGAGTTGTAGAGATTGCCACAACATTTCGAGAAAAGAAGATACCTTGTGACGTGATATGGATGGACATTGATTACATGAATGCTTGGCGTTGTTTTACGTTCGACCCTGAAACGTTTCCTGAACCAGCCAAGCTCTCCGATTTGCTGCATGAGAAGGGTTTCAAGGGTGTGTGGATGCTTGATCCTGGCATCAAGCAAGAACCAGGCTGGTCTGTATATGACTCTGGAACGGCTGAGGATGTTTGGGTCCTTCAGGCGAACAAGAAACCTTACGCTGGTGAAGTGTGGCCTGGTCCATGTTGTTTCCCAGATTACACTCAAGCAAAAACTAGGAAATGGTGGGGAGGGCTGGTAAAAGACTTTGTCAAGATCGGTGTCGATGGGATTTGGAATGACATGAATGAACCTGCAGTCTTTAAGAGTTTGTCAAAGACGATGCCTGATACAAACATTCATAGAGGAGACGAAGATCTCGGTGGAAGCCAAAATCACCAGCACTACCACAATGTTTATGGAATGTTGATGGCTAGATCAACCTACGAAGGGATGATCCTTGCTAATCCTGAGAAGAGGCCATTTGTGTTGACCAGAGCAGGTCATGTTGGAAGCCAAAGATACGCTGCCACATGGACCGGTGACAATCTTTCAAGTTGGATTCATCTTGAAATGAGTATACCAATGTCACTAAACTTGGGTTTAAGTGGACAACCATTCTCCGGGCCTGATATTGGTGGGTTTGGTGGAAACGCTACACCTCAAATGTTTGCGCGGTGGATGGGTATTGGGGCCATGCTTCCATTTGCACGTGGCCATTCAGAGAAAGGGACTGTGGATCATGAACCCTGGGAATTTGGGAAAGAGTGTGAAGACGTCTGCAGGCAGGCACTGTACAGGAGATATCGGATACTGCCGCACCTATATACTCTATTTTACAAGGCACATACGACAGGAGTTCCGATAATGTCTCCTTTGTTCTTTGCTGATCCTAAAGATGAGAAACTTCGTAAGGTGGAAGACAGTTTTCTTCTGGGACCTCTCCTTGTTGCTGCATGTACTAAGGCTGGCAAGAAACCTGATCCAAAGAAGACTGTCCTACCTAGTGGGTTGTGGCAGCTCTTTGATTTTGATGACTCACATCCGGATTTACCGTTACTGTTCCTCAAGGGAGGATCTATCATCCCGACTGGTCAGGTTTCACAGAACACTGGTGATGTCAGTGAGAATGATCCCATCACCCTCATCATAGCTCTTGATGAAGAAGGAAAAGCAGAAGGGACACTCTACGAGGATGATGGAGATAGTTTTGAATATAAAAAAGGACAGTTCTTGCTTACTCGTTACTCTGCTGCCCTGGTCTCCAGCTCCACCGGAGGTTCAAGTGGAAAAAAGATCGTCATCAAAATCACCCAGTCGGATGGTTATCTTAGCAGGCCCAAGCGTCCTCTCAAAGTGCGCATTTTGCTCGCAAATAAAGCGGAGCTTGAGGGTGAAGGTGTCGATGGCGAGGAATTGACTGTCGACCTGCCTTCACGTTTCGACATGGCCCAGATCACGACTCTCATCCAACAGCAGGATCTGCCAAAGGAAGATGATGAGAATATTCCAGATGAAGCCGATCACGAGACCTCAGAGAGCCTTGCAGTTCCTCCAACGACTCTCCTGGATCTGAAAATTGGTGACTGGTCTCTAAAAGTGGCGCCCTGGATTGGTGGTCGGATTGTTTCTATGATCCACGAGCCCACAGAGACCGAATGGCTGGAAGGAAAACTGGAGCATGGAGCGTATGAAGAGTACAGTGGGGTCGAATATCGATCTCCTGGCTGTGTGGAGCAGTATGATGTCAAAAAGGAGCAGTCAGATATTGAAGGAACAGACGGAGTAGTTATGGAAGGTGACATTGGTGGAGGATTGGTGATGTGTCGTCAAATTGGCGGCAAGGGAGCTGATTCGAAAATAGTACAGATAAGCTCCTCAATTGAGGCTAGATCTGTTGGTGCTGGTTCAGGTGGATTTTCTAGGTTGGTGTGTTTGCGGGTGCATCCATCTTTCAAAGTTGCCAACTATGAACTGGCATTGGTGAAGTTCACATCCATCAGTGGAGAAGCTCGAGAAATAGTCCCGAAGCCCGGCGATATAATGTTGACAGCAGACGACCGACCAAATGGAGAGTGGGCATTCTTAGACAAAGAGAACGGAGTGGCCATCGTGAACAGATTCAACCCAGAGCAGGTGTTCACGTGTGTAATCCACTGGTCAGCAGGCATTTGCAACTTGGAGCTTTGGTCTGAGGAGAGGCCCGTTTCCAAGGAAACACCTCTTCAGATATGCCATGAATATGAGACTGTCAGTGAGCAGGAGCTGCTTCAGTCTCAGCCTTGA |
Protein: MPPAKKMLWAPIIQEGVFRFDANEGAKKQAWPSVSFVNGQDRESPITLTAEAHIREPLYIPQCRTDNGSQTITVKLPEGTSFYGTGEVSGPLERTGKRVFAWNTDAWGYGPSTTALYQSHPWVLALLPDGTTFGVLADTTRRAEIDTRKASTIRFVASGSYPVITFGPFSSPEAVLTALSKATGTLAMPPKWTLGYQQCRWSYETADRVVEIATTFREKKIPCDVIWMDIDYMNAWRCFTFDPETFPEPAKLSDLLHEKGFKGVWMLDPGIKQEPGWSVYDSGTAEDVWVLQANKKPYAGEVWPGPCCFPDYTQAKTRKWWGGLVKDFVKIGVDGIWNDMNEPAVFKSLSKTMPDTNIHRGDEDLGGSQNHQHYHNVYGMLMARSTYEGMILANPEKRPFVLTRAGHVGSQRYAATWTGDNLSSWIHLEMSIPMSLNLGLSGQPFSGPDIGGFGGNATPQMFARWMGIGAMLPFARGHSEKGTVDHEPWEFGKECEDVCRQALYRRYRILPHLYTLFYKAHTTGVPIMSPLFFADPKDEKLRKVEDSFLLGPLLVAACTKAGKKPDPKKTVLPSGLWQLFDFDDSHPDLPLLFLKGGSIIPTGQVSQNTGDVSENDPITLIIALDEEGKAEGTLYEDDGDSFEYKKGQFLLTRYSAALVSSSTGGSSGKKIVIKITQSDGYLSRPKRPLKVRILLANKAELEGEGVDGEELTVDLPSRFDMAQITTLIQQQDLPKEDDENIPDEADHETSESLAVPPTTLLDLKIGDWSLKVAPWIGGRIVSMIHEPTETEWLEGKLEHGAYEEYSGVEYRSPGCVEQYDVKKEQSDIEGTDGVVMEGDIGGGLVMCRQIGGKGADSKIVQISSSIEARSVGAGSGGFSRLVCLRVHPSFKVANYELALVKFTSISGEAREIVPKPGDIMLTADDRPNGEWAFLDKENGVAIVNRFNPEQVFTCVIHWSAGICNLELWSEERPVSKETPLQICHEYETVSEQELLQSQP |